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Abstract. We report model calculations on DNA single strands which describe the equilibrium dynamics 
and kinetics of hairpin formation and melting. Modeling is at the level of single bases. Strand rigidity is 
described in terms of simple polymer models; alternative calculations performed using the freely rotat- 
ing chain and the discrete Kratky-Porod models are reported. Stem formation is modeled according to 
the Peyrard-Bishop-Dauxois Hamiltonian. The kinetics of opening and closing is described in terms of a 
diffusion-controlled motion in an effective free energy landscape. Melting profiles, dependence of melting 
temperature on loop length, and kinetic time scales are in semiquantitative agreement with experimental 
data obtained from fluorescent DNA beacons forming poly(T) loops. Variation in strand rigidity is not 
sufBcient to account for the large activation enthalpy of closing and the strong loop length dependence 
observed in hairpins forming poly(A) loops. Implications for modeling single strands of DNA or RNA are 
discussed. 

PACS. 87.15. He Dynamics and conformational changes - 87.15.Aa Theory and modeling; computer sim- 
ulation - 87.14.Gg DNA, RNA - 36.20.Ey Conformation (statistics and dynamics) 



1 Introduction 

' DNA beacons are made of short single strands of DNA 
I with terminal regions consisting of complementary base 
' sequences. As a result the two end-regions can self-assemble 
in a short DNA double helix, called the stem, while the 
, remaining central part of the strand forms a loop. In this 
closed configuration, the single strand has the shape of a 
hairpin. Such hairpin conformations are present in the sec- 
i ondary structure of long single strands of DNA or RNA. 
' A short single strand of DNA which can form a hairpin 
becomes a so-called "DNA beacon" when one of its ends is 
attached to a fluorophore while the second end is attached 
to a quencher. When the fluorophore and the quencher are 
within a few Angstroms of each other, the fluorescence is 
suppressed due to direct energy transfer from the fluo- 
rophore to the quencher. Consequently in a closed hairpin 
configuration the beacon is not fluorescent, whereas in the 
open configuration it becomes fluorescent. This property 
leads to many interesting applications for molecular bea- 
cons in biology or physics. 

Biological applications use the possible assembly of a 
portion of the single strand which forms the loop with an- 
other DNA strand which is complementary to the loop's 
sequence. The assembly of the single strand of the loop 
with another strand to make a double helix is only pos- 
sible when the hairpin is open because double-stranded 



DNA is very rigid. Therefore, when the assembly occurs, 
the fluorescent signal is restored Ij. This technique pro- 
vides very sensitive probes of the sequences which are com- 
plementary to the loop. Using this idea, it has been sug- 
gested that DNA beacons could be used in vivo to detect 
the single stranded RNA which is synthesized during the 
transcription of genes. This could allow the recognition 
of cancer cells by targeting some genes which are heavily 
transcribed in those cells [11|3]. 

Physical applications exploit the high reproducibility 
of the hairpins' self-assembly process which makes it pos- 
sible to build molecular memories read by detecting the 
fluorescence [3] or devices capable of performing molecular 
computation [5]. 

Understanding the DNA hairpin self-assembly process 
at the mesoscopic scale is possible because molecular bea- 
cons allow accurate monitoring of the opening and closing 
steps. The "melting profile" of the stem, induced by heat- 
ing, can be recorded accurately versus temperature and 
the autocorrelation function of the fluorescence can be 
used to extract the kinetics of the opening/closing fluc- 
tuations. Measurements have been made [B] for different 
loop lengths and different bases in the loop. They pro- 
vide a complete set of data which can be compared to 
the results of a theoretical analysis in order to determine 
the basic mechanisms controlling the properties of DNA 
hairpins. This is the primary aim of the study presented 
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here. It should be noted however that our results, because 
of their strong sensitivity to the properties of the loop, 
turn out to have imphcations which extend beyond the 
properties of hairpins as such. The detailed comparison of 
experimental data with the results of various loop models 
enhances our ability to model single strands of DNA and 
RNA. 



2 The model 

The closing of a DNA hairpin has some similarity with 
the folding of a protein in the sense that it is an evolu- 
tion from a random chain to a geometrical shape which is 
stabilized by weak bonds established between some of its 
components, here the bases of the stem. The full process is 
quite complex because it involves the precise positioning 
of a large number of atoms in space to form the strands of 
the stem. However one may reasonably argue that, in or- 
der to understand experimental observations such as the 
fluctuations of a beacon, one does not need to know all 
the details of the process. A simple view is to consider 
the DNA strand as a polymer chain. Then it should be 
possible to combine known models for the stem with a 
polymer model for the loop. This has been done in an ap- 
proach that uses the simplest possible model for the stem 
[7j, an Ising model in which the bases are either closed or 
open, and a semiflexible polymer model for the loop. This 
approach gave interesting results, in particular concerning 
the estimation of the persistence length of single-stranded 
DNA. However it has the drawback that the description 
of the stem is very rough and relies on empirical param- 
eters, such as the entropy change involved in the closing 
of two bases, which cannot be justified within the model 
and have to be fitted. Moreover, as the Ising model of 
the stem ignores all geometrical parameters, such as the 
distance between the bases linked to the loop, the match- 
ing between the models of the stem and the loop has to be 
crude. A further aspect which is not satisfactory in such an 
approach is that it uses two different models for the stem 
and the loop while both belong to the same DNA single 
strand, and should be described in the same framework. 
This is what we are doing in the present study. It should of 
course be clear that the pairing of bases in the stem leads 
to additional phenomena which do not occur in the loop 
and must be taken into account in order to complete the 
model. Last, but by no means least, we would like to argue 
that a model with continuous degrees of freedom is more 
apt to describe the end-to-end distance, which is a nat- 
ural "reaction coordinate" measured by the fluorescence 
signal. 

A schematic picture of our hairpin model is shown in 
Fig. [TJ It consists of a stem of M base pairs and a loop 
with L segments, i.e. L—1 bases so that the single strand 
which forms the hairpin has a total of 2M + L — 1 bases 
or N ~ 2M + L — 2 segments. This single strand can be 
described by different polymer models. The dependence 
of our results on the particulars of the polymer model 
will be discussed in Sec. For the moment, let us con- 
sider only one of them as the basic model of our study, the 




Fig. 1. A schematic picture of the model to define some no- 
tations. The hairpin is made of a stem of M base pairs and a 
loop of L segments, i.e. L — 1 bases. The bases along the strand 
are labeled by an index i ranging from 1 to L — 1 + 2M. The 
variables pm represent the distances between the bases form- 
ing base pairs m in the stem. The stretching of the base pair 
distance is denoted by ym so that pm = j/m + d, where d is the 
equilibrium distance between the bases in a DNA double helix. 



Kratky-Porod (KP) j8j model, also known as the wormlike 
chain (WLC) in its continuum version [T^] ; for DNA hair- 
pins which have short loops and a very short persistence 
length the original discrete version is more appropriate. 
The Kratky-Porod model considers the chain of bases as 
made of rigid segments of length £. The orientation of a 
segment in three-dimensional space is defined by a vec- 
tor Ri of unit length, lying along segment i, as shown in 
Fig. [TJ Therefore the end-to-end distance of this chain, 
labeled pi in Fig. [1] is 



Pi 



N 



(1) 



The energy of the KP model is 



Ei{N)^-eeY.{R,-R,+i-l) 



(2) 



where e^^ measures the energy that is necessary to bend 
the polymer at a joint, i.e. it is a parameter that measures 
the rigidity of the strand. 

To complete the description of the hairpin we must 
also add the interactions which may take place within the 
stem when base pairing occurs. We use an approach based 
on the PBD model for DNA melting [QllTO] by adding to 
the polymer model the energy contribution 
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where -D, a, K, Q are constant parameters and ym = Pm~d 
denotes the deviation of the distance pm between two 
bases in pair m from its equilibrium value, d, in the dou- 
ble helix. In other words ym is the stretching of the m**^ 
base pair in the stem, and is a function of the vectors Ri 
which define the geometrical shape of the strand. The po- 
tential energy of the stem includes Morse potentials V(y) 
describing the pairing energy between two complementary 
bases. The Morse potentials describe an effective inter- 
action which includes the attractive contribution of the 
hydrogen bonds between the complementary bases and 
the repulsion coming from the charged phosphate groups 
on the strands. The other important energy terms in the 
stem are the stacking interactions between consecutive 
bases, described by the nonlinear potential W{y,y'). In 
fact stacking energies are also present in an implicit form 
in the polymer model of the strands since the flexibility 
of a singe strand of DNA is affected by the interactions 
between the bases which are part of the nucleotides. In 
the stem however the stacking energy increases because of 
the geometrical constraints of double helix packing. The 
base pair plateaux are piled on top of each other and in- 
teract strongly due to the overlap of their n electrons. 
If one of the two adjacent pairs is open the double helix 
packing disappears and the prefactor exp[— C(2/m + 2/m-i)] 
vanishes [TT] . It is the geometrical constraint which allows 
us to use the scalar variable y„i to describe the base pair 
status in the stem. In this geometry the displacement of 
the bases is essentially orthogonal to the stem axis and 
therefore the stretching of the individual base pairs pro- 
vides a mesoscopically acceptable description of the stem's 
state. 

Our choice of E2 (M) is based on the PBD model which 
has been widely tested for DNA melting [9lfT0] but other 
expressions are certainly possible, provided they properly 
describe the physics of the molecule. The potential be- 
tween the bases has to include a stong repulsion when the 
bases approach each other (y™ < 0) and the force has to 
tend to zero (constant potential) at large ym- The Morse 
potential has the proper qualitative shape. Similarly ex- 
pressing W{ym,ym-i) by a harmonic interaction with an 
effective coupling constant K ex.p[—(^(ym+ym-i)] is a sim- 
ple way to describe the decay of the stacking which is 
expected when base pairs open. Studies of variants of the 
PDB model show that different expressions preserving the 
same qualitative properties imposed by the physical con- 
straints lead to quantitative changes in the results which 
are also obtainable - within the accuracy of experimental 
observations - by varying the PDB model parameters. 

It should be noted that expression Q imposes a pri- 
ori the bases which can be linked by a pairing potential. 
In other words it assumes that the base sequences in the 
terminal regions of the strand are such as to guarantee 
full pairing in the closed hairpin. Mismatches are thus not 
allowed in the model. We do not expect them to play a sig- 
nificant role in the physical system, at least in the case of 
the short stems under consideration, since the relative en- 
ergetic cost of a mismatched configuration would be high. 



The potential energy of the hairpin is _E = Ei{N) 
E2{M). 



3 Thermodynamic properties 

3.1 Constrained partition function and free energy: 
principle of the derivation. 

The fiuorescence of DNA beacons is determined by the 
distance pi between the two ends of a strand, which carry 
the fluorophore and the quencher. In order to analyze the 
experiments we must therefore determine the probability 
VNipi)dpi that a strand will have an end-to-end distance 
in the interval -I- dpi); this is - within a normal- 

ization factor - identical to the constrained configuration 
partition function 



2Nipi) = UdrN 6 



N 



N 



pi 



-0E{r«) 



(4) 



obtained by integrating the Boltzmann weight over the 
configuration variables symbolically denoted by Iat for a 
DNA strand of N monomers under the constraint of fixed 
end-to-end distance imposed by the Dirac delta function. 
The normalized probability density function 



Vnipi 



Zn{pi 



(5) 



is obtained by dividing (j4]) by the unconstrained partition 
function 



N 



(6) 



The calculation of Z{pi) for a hairpin is complicated 
by the presence of interactions within the stem because 
they involve the relative ■positions of two segments of the 
polymer. In order to proceed let us start from the L seg- 
ments forming the loop. Their study is simpler because 
they form an ordinary polymer, which, in our case, is 
described by the KP model. The configuration partition 
function of the loop is and the probability that the 
two ends of the loop are at distance pM, the distance be- 
tween the two bases at the end of the stem connected to 
the loop, is 

Pl{pm) = Zl{pm)/ZI. (7) 

In contrast to the expressions of Eq. ([5]) which refer to 
the full hairpin, the corresponding terms in Eq. ([7]) refer 
to an ordinary polymer without the additional constraints 
imposed by the stem. Their derivation is discussed in the 
subsection 13.21 To stress this distinction we have used a 
script notation Z, V for quantities that cannot be ob- 
tained from standard polymer theory. 

Now that the loop is characterized, let us derive the 
partition function of the hairpin by successively adding 
the segments which form the stem, one segment at a time. 
If we start from the loop and add one monomer at each 
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end, we have built one segment of the stem. The distance 
between the new ends of the strand is now pM~i- Using 
the notation of Eqs. (O or (O, for this extended polymer 
consisting of L + 2 monomers, we have 



(8) 



In order to evaluate Vl+2{pm-i), let us introduce a condi- 
tional probability S{p'\p) that, if a polymer of p monomers 
has its ends at distance p, a polymer of p + 2 monomers, 
obtained by adding one monomer at each end of the pre- 
vious one, has the distance p' between its ends. For the 
polymer alone i.e. without the contribution of the energy 
E2 in the stem, this conditional probability is such that 



Pp+2{p') 



s{p'\p) PM dp . 



(9) 



In the presence of the energy terms E2 within the stem, the 
conditional probability S{p'\p), determined by the prop- 
erties of the polymer alone, must be corrected by a Boltz- 
mann factor containing the potential energy terms V{p)-\' 
W{p', p) due to base pairing and stacking interactions. Ac- 
cordingly, the hairpin with a single pair of complementary 
bases will satisfy 

/>CXD 

Vl+2{pm-i) = / dpM e-^[^(''-)+^(P--i^P-)l 

X S{pm~i\pm) Pl{pm) ■ (10) 

The process can be iterated to add the remaining segments 
of the stem. The advantage of this progressive buildup 
of the stem is that it explicitly introduces the distances 
between the bases that pair in the stem in the calculation, 
allowing us to include the proper statistical weights arising 
from pairing and stacking energies in the stem. 

Once all the stem segments and stem energy terms 
have been included we obtain 



Zn{pi) ^ Z% Jdp2. ■■ J 



dpM e 



-/3\/(pi) 



X g-P[V{p2) + W(p^,p2)] X ^ ^ 

-I3[V(pm) + W(pm-i-Pm)] 



X e 

X S{pi\p2) ...S{pm-i\pm) 

X PLipAl) ■ (11) 

This expression gives the constrained partition function 
of the hairpin in terms of properties of the polymer form- 
ing the strand, Pl{p) and S{p'\p). It is therefore vahd for 
any polymer model, provided one can derive these two 
probability distributions for the model of interest. The 
constrained partition function Zn {pi ) defines an effective 
free energy J-{pi) = —kBT\nZM{pi) for the hairpin hav- 
ing the distance pi between its ends, i.e. it gives the free 
energy landscape using pi as the relevant coordinate. The 
result appears as a [M — l)-dimensional integral but - like 
any transfer integral - it can actually be computed by a se- 
quence of M — 1 one-dimensional integrations. Performing 
first the integration over pM^ we get a function of pm-i', 



next, the integration over pM-i gives a function of pm-2, 
and so on, until the last integration over p2 which gives 
the desired constrained partition function. Therefore, the 
calculation of 2'jv(pi) is a relatively straightforward nu- 
merical task since Pl{p) and S{p'\p) can be derived from 
appropriate polymer models. 



3.2 The properties of the Kratky-Porod model, and 
the effective Gaussian approximation 

In order to proceed further with the calculation of J'{pi) 
we need expressions of Pl{p) and S{p'\p) for the polymer 
model chosen to describe the DNA strand, i.e. the Kratky 
Porod (KP) model having an energy given by Eq. This 
model has been widely studied in the continuum limit, 
known as wormlike chain (WLC) where the energy tends 
to 

(12) 



.A 


dR 


1 dx 


'0 


dx 



^1-2 



for a given chain length A = L£, in the limit L — s- cx), 
£ — > 0, provided ei'^ k, the continuum chain stiffness. 

However, the probability distribution function P{p) 
obtained in the continuum limit [T^l[T51fT^l[T5] is not ap- 
propriate for DNA hairpins for which the loops may not 
be longer than a few persistence lengths of single-stranded 
DNA and the persistence length itself hardly exceeds the 
monomer distance. In this case the continuum limit be- 
comes a priori questionable and the discrete expression of 
Eq. ([2]) should be preserved. The partition function of 
a polymer of L segments is readily calculated as 



d[2i . . . dflL^ 



Att [47re ^io 



1 i-i 



(13) 

where h = f3ei^ and io{b) = sinh(6)/6 is the modified 
Bessel function of zeroth order. The mathematical equiva- 
lence of this model with the classical Heisenberg ferromag- 
netic chain |16j can be used to show that the orientational 
correlations between different segments have the form 



(Rr ■ Rs] 

with a persistence length 

f 

A 



\r-s\£/\ 



ln[ii{b)/io{b)] ln[coth(6) - l/b] 



(14) 



(15) 



where ii{b) — [fecosh(6) — sinh(6)]/&^ is the modified Bessel 
function of first order. For the discrete KP model, the end- 
to-end distribution function can be computed numerically 
from its Fourier transform, which can be expressed [17j as 
the leading matrix element 



(16) 



of the A^th power of a symmetric matrix F whose elements 
are given by 
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Fii'iq) - \ [{2l + l){2l' + l)li{h)iv{b) 



1/2 



E (2fc + i)H) 

k=\l~l' \ ,k+l+l' =2r 

^{r - k)1'{r - l)1'{r - V) 



1 



where 



•f (n) = 



tf(r) 



r(n + i)r(i) 



r + l/2 
3k{q) , (17) 



n 



1 — 

2j 



jk{q) is the spherical Bessel function of fcth order, and 
ii{b) = ii{b)/io{b). In practice one can obtain numerically 
accurate results even for short stiff polymers (L = 10, A = 
0.8L£ for instance) by summing no more than 8 terms in 
(|17p . Furthermore, since only the leading matrix element 
is required, direct matrix multiplication is quite efficient. 

The derivation of the conditional probability S{p'\p) 
is even more demanding than the calculation of P{p) and 
we have not been able to obtain it for the KP model. For- 
tunately however, in the case of weak chain rigidity, there 
is a way to go around this difficulty because, as shown 
in appendix lAl the conditional probability S{p'\p) can be 
calculated exactly for a Gaussian chain, made of orienta- 
tionally uncorrelated links such that the probability for 
any segment to lie along a vector is proportional to 
exp(— |Z\p/4T^). It is given by 



S{p'\p) = 



1 



^e-(p''V)/8 
P 



(18) 



The Gaussian probability function P'~^ (p) can be used 
to approximate the end-to-end distribution function of the 
KP chain P^^{p) by choosing a temperature dependent 
value of its parameter cr^ = Lr^ so that the average square 
of the end-to-end distance of a Gaussian chain with L 
segments {p^) = L l'^ = 6cr^ — 6Lt^ matches the average 
value of for the KP chain 



with 



X 



t 



1 + coth(6) - 1/b 
1 -coth(6) + 1/6 



(19) 



(20) 



To get a Gaussian approximation for the KP chain we 
must therefore select 



X 
6 



(21) 



Figure [2] shows that the Gaussian approximation is fairly 
good for L — 24 and becomes poor for L = 14. How- 
ever we do not actually need to use P'^{p). The advan- 
tage of the Gaussian approximation is that it provides the 



(xlO-^) 



L = 24 




(xlO-^) 



L = 14 



3 




Fig. 2. Comparison between the Kratky Pored distribution 
function Pj^^ip) (full line) and the Gaussian approximation 
Pl{p) (dashed line) for two values of L. The crosses show 
the distribution function Pl{p) obtained by starting from the 
Kratky Porod distribution Pl-2{p) and computing the proba- 
bility distribution of the end-to-end distance of a polymer ex- 
tended by two units, using the conditional probability S{p'\p) 
according to Eq. [9] The parameters for the KP model are 
e = 0.0016 eV/A^, I — & K, and the calculation has been 
made for T = 300 K, giving b = fief = 2.26 and A = 10.8 A. 



basis for an approximate expression of S{p'\p) given by 
Eqs. (dH]) and (pij) . which can be used in order to com- 
pute Pl+2 from Pi, according to Eq. (fTO|) . by providing 
for Pl the numerical result P^^{p), i.e. a value which 
is very accurate. In this approach the error introduced 
by the Gaussian approximation only affects the variation 
of P{p) when the polymer is extended. Figure [2] shows 
that, even for a short loop L = 14, for which the Gaus- 
sian approximation is poor, the comparison between the 
approximated expression of Pl+2 and the accurate nu- 
merical value P]^^2 quite good, and becomes excellent 
for longer loops (L = 24). This gives us all the ingredi- 
ents that we need to compute the constrained partition 
function of the hairpin Zn{pi) according to Eq. (fTT|) . 
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3.3 First results. 

Before discussing all the results in Sec. [5] it is useful to 
consider an example which illustrates the thermal prop- 
erties of hairpins and introduces some quantities which 
will turn out to be relevant in the next section on kinetic 
properties. 



0.6 
0.5 
0.4 
0.3 
0.2 
0.1 


-0.1 

-0.2 I ' ' ' ' ' ' ' ' ' ' 

20 40 60 80 100 120 140 160 180 200 220 

P 

10 





closed state of the hairpin, while the broad maximum 
at large pi corresponds to the open configurations. This 
shape of Vn{pi) points out that, at any temperature, the 
open and closed forms of the hairpin coexist. The open- 
ing "transition" is only a shift of the equilibrium from one 
temperature regime where the closed configurations dom- 
inate to another where the open states are the majority. 
This is not surprising since even an approximate phase 
transition should not be expected in a small system such 
as a DNA hairpin. Therefore, in order to provide a mea- 
sure of the opening of the hairpin we have to compute 
the fraction of open states, which can be obtained from 
the probabihty distribution Vn{pi) by defining as closed 
the states for which pi < p* and open those for which 
p* < pi < Pmax, where p* is the value of pi corresponding 
to the minimum of Vn{pi) (i-e. the maximum of !F{pi)) 
and Pmax = N^ is the maximum distance between the 
ends of the DNA strand, determined by the length of the 
strand. The respective probabilities to find the hairpin in 
the closed and open configurations are thus 



VN{pi)dpi Po 



VN{pi)dpi . (22) 



Since VNipi) is normalized, i.e. Pc+Po — 1, Po also repre- 
sents the fraction of open configurations at a given temper- 
ature. Performing such a calculation as a function of tem- 
perature gives the so-called "melting curve" of the DNA 
hairpin. Figure 3] shows two examples of such curves for 
L — 24 (the case illustrated in Fig. [3]) and a case with a 
shorter loop (L 14). If we define as the temperature 
at which po — Pc, we get Tm{L = 24) = 317.7 K and 
Tm{L — 14) — 337.0 K for the model parameters that we 
used in these calculations. In the context of this paper we 
will also refer to Tm as the opening temperature of the 
hairpin. 



Fig. 3. Behavior of the hairpin model at two different tem- 
peratures, r = 300 K (thin line) and T = 350 K (thick line), 
for L — 24. Top figure: free energy versus pi, bottom figure: 
probability density P{pi) in logarithmic scale. The parameters 
of the model are D = 0.16 eV, a = 6.9 A'^ K = 0.125 eV/A^ 
C = 0.10 A-\ e = 0.0016 eV/A^ ^ = 6 A. 

Figure [3] shows the effective free energy T[pi) for a 
hairpin with M — h base pairs in the stem and L = 24 
segments in the loop, at 300 K and 350 K, and the cor- 
responding probability distributions Vn{pi)- As shown 
below, these two temperatures are on both sides of the 
opening temperature Tm of this hairpin. However Fig. [3] 
shows that J-^{pi) and VNipi) maintain the same quali- 
tative shape at both temperatures, ^{pi) has a narrow 
well around pi = 10 A which is the equilibrium distance 
between the bases in a DNA double helix; the narrow well 
is separated from a broad secondary minimum at larger 
pi by a fairly sharp maximum. The probability density 
Vn{pi) = Zn{pi)/Z% ^ exp[-/3J^(pi)]/Z^ exhibits two 
peaks. The peak around pi = 10 A corresponds to the 




200 250 300 350 400 



T(K) 

Fig. 4. Variation versus temperature of the percentage of open 
hairpins for two different loop lengths L = 24 (full line) and 
L = 14 (dotted line). The parameters of the model are M = 5, 
D = 0.16 eV, a = 6.9 A'^ K = 0.125, C = 0.10 A'S e = 
0.0016 eV/A^, ^ = 6 A. 
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4 Kinetics 

The derivation of the free energy T{pi) allows us to go 
beyond the analysis of the equilibrium properties of the 
hairpins because it exhibits the characteristic shape of a 
system evolving between 3 states C ^ T* ^ O, the closed 
C and open O states associated, respectively, with the 
minima of ^(pi ) and an unstable transition state T* cor- 
responding to the intermediate maximum. This suggests 
that the multidimensional dynamics of the opening and 
closing of the hairpins can be viewed as a reduced prob- 
lem of reaction kinetics. If the system is strongly coupled 
to its environment, the dynamics of the molecule has no 
memory of its velocity so that it is well described by a 
diffusion on the free energy surface !F{pi). For the large 
molecular units involved in the opening/closing of DNA 
hairpins this is a reasonable assumption. Studying the ki- 
netics of hairpin fluctuations is thus reduced to the cal- 
culation of a first passage time in a diffusion controlled 
process plfTO]. 



-2 



o 



Fig. 5. Schematic of the reaction kinetics view of the hairpin 
opening/closing, and definition of the reaction rate constants 
of the processes involved. 



In the language of chemical reaction kinetics, if we 
denote the concentrations in the three states by pc, p*, 
Po , respectively, and use the kinetic constants defined in 
Fig- El the temporal evolution of the rcactants is described 

by 



dpc 
dt 

dpo 
dt 

dp* 

~dt 



-kip^ + k^ip* (23) 

k2P* - fc_2Po (24) 

fcipc + fc-2Po - (fc-1 + k2)p* . (25) 



Under the standard assumption of rapid intermediate state 
dynamics, there is no variation of the concentration p* on 
the time scale of the diffusive motion which controls bar- 
rier crossing. The condition dp* / dt — implies 



P 



k-i + k2 



(26) 



and allows us to eliminate the concentration of the tran- 
sition state from the equations, leading to 



dpc kik2 

dt fc-i + k2 

dpo _ fc-lfc-2 

dt fc-i + k2 



Pc 



Po 



A:-ifc_2 

k-i + k2 

kik2 



Po = -kfPc 
~Pc = kj,po 



krPo (27) 

kfPc ■ 
(28) 



The equilibrium concentrations Pc and Po satisfy the 
relationship Pc/Po — kr/kf, which allows us to rewrite the 
inverse of the forward and reverse kinetic constants as 



kZ^ 



k^ — k-^ ~\~ _ k 2 

Po 
Po, 



Pc 



(29) 
(30) 



The ratio of equilibrium concentrations is given by the 
ratio of partition functions of the corresponding states 



Pc 
Po 



Zo ' 



(31) 



where 2c and Zo designate the partition function of the 
hairpin restricted to pi < p* or pi > p* respectively. The 
kinetic parameters are now expressed only in terms of 
equilibrium properties and the lifetimes k^^ and kzl of 
the closed and open states, which we must evaluate. 

Each of the two states corresponds to a basin of the 
free energy T[pi), and the lifetime of the closed and open 
states is therefore the first passage time of the coordinate 
pi above the barrier that defines the boundary between 
the two basins for pi = p* . The diffusion on the free energy 
surface is described by the Smoluchowski equation 



dr{pi,t) _ djipi,t) 



dt 



-Do 



dpi 
dV{pi,t) 
dpi 



aF(pi) ,^ 

P— ^ V[pi,t) 

dpi 



(32) 
(33) 



where j{pi,t) is the current of the probability V{pi,t) that 
the distance between the ends of the hairpin is pi at time 
t. 

The diffusion coefficient Dq is determined by the actual 
diffusive mechanism of the elements of the DNA strand in 
the solvent that surrounds the hairpin. It could in principle 
depend on pi , but a reasonable assumption in an ordinary 
solvent is to consider Dq as a constant. Its value sets the 
timescale of the opening/closing of the hairpin. 

The calculation of the first passage time r for Eq. ((33)) 
has been made by Szabo et al. [T9^; an alternative deriva- 
tion, outlined in Appendix [Bl for the sake of completeness, 
has been given by Dcutsch 20J. The result is 



dv 

PO Dopoir) 



with 



I{r) = / dppo{p) , 

J Po 



(34) 



(35) 



where po defines the limit of the basin of interest (po = 
for the basin corresponding to closed hairpins, po ~ Pmax 
for open hairpins) and po(^) is the probability that pi = r 
in the basin of interest, determined by the free energy 
J^{pi) according to 



f.-l3^{r) 

Po{r) = — T- — — 

dpe-l^^ip) 

J Po ' 



(36) 
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From Eq. (|29|) we get 



f Jo Doexp[-PT{r)]/Z, 



dr 



1 



i?oexp[-/3^(r)]/Zo 



Ilir), (37) e 



where we denoted by Icir) and Io{r) the integral ([55]) com- 
puted in the basin for closed or open states respectively. 
To avoid overflows in the calculations it is convenient to 
rewrite those integrals by introducing inside them the fac- 
tor exp[/3J^(r)]. If we define 



) = ^ f dpe-^^^'^P^-^'^-'^^ for r<p* (38) 



1 /-Pn 



J{r) = ^ / dpe-'^^^^P^-^^''^^ for r > p\ 



equation ([37| gives 



d^^e"'5^Wj2(r) 
-Do 



(39) 



(40) 



and an equivalent expression for ^ with Zo can also be 
obtained. 



10" 



10-" 



10^ 



10'' 



10-^ 



10^ 



n 

























2.6 2.8 3 3.2 3.4 3.6 
1000/T(K"^) 

Fig. 6. Opening (open squares) and closing (closed squares) 
reaction rates kj and fcr, in logarithmic scale, versus lOOO/T. 
L = 24. The other model parameters are the same as for Fig.|4l 
The calculation has been made with Do = 3.0 10® cm^/s. The 
full lines show fits by Arrhenius laws with Eo ~ 0.73 eV and 
Ec = 0.15 eV. 



Figure [6] shows the temperature dependence of the 
opening and closing times, kj^ and respectively. The 

values are proportional to Dq^, the inverse of the diffusion 
coefficient introduced in the Smoluchowski equation ([33|) . 
Measurements for single strands of DNA [5T] give diffu- 
sion coefficients of 1.5 10® cm^/s. We have used the value 
Do = 3.0 10® cm^/s, which is a reasonable estimate for 
the shorter pieces of DNA strands involved in the closing 
of the hairpins that we consider. This choice leads to time 




Fig. 7. Variation of the opening temperatures of DNA hair- 
pins deduced from the model where the loop is described by 
a Kratky Porod model for e — 0.0012 eV/A'^ (circles) and 
e = 0.0016 eV/A^ (squares). The curves show fits with the 
function T = Toi". 



scales of kj ^ and fc^ ^ which are in good agreement with 
the experiments [6]. 

Their temperature dependence is well fitted by Arrhe- 
nius laws 



(41) 



Both activation energies are positive in agreement with 
the experimental observations [5] . The opening activation 
energy Eo = 0.74 eV is very close to MD = 0.80 eV which 
is the energy corresponding to the breaking of the M base 
pairs of the stem. 



5 Discussion, role of the model of the loop. 



The properties of the model can be examined in the light 
of experimental studies of DNA beacons which investi- 
gated the effect of the length and composition of the loop 
[6,22 . 

Figure [7] shows the variation of the opening tempera- 
ture Tm versus the length of the loop for two values of the 
parameter e that governs the rigidity of the Kratky Porod 
model, e = 0.0016 eV/A^ giving a persistence length A = 
10.82 A at 300 K (A/£ = 1.8) and e = 0.0012 eV/A^ 
giving a persistence length A = 8.05 A at 300 K {X/i — 
1.34). Measurements of the persistence length for single- 
stranded poly(T) DNA give values in the range 7.5 to 
13 A, depending on the salt conditions, with some mea- 
surements leading to values as high as 40 A ^23..24. . Single- 
stranded poly(A) can be expected to have a larger persis- 
tence length because adenine bases are larger than thymines. 
However, for short loops it may be difficult to draw a def- 
inite conclusion because some all- atom molecular dynam- 
ics simulations [25] show that the larger bases may be 
expelled from the inside of the loop due to steric repul- 
sions while the smaller ones may stay inside and stack on 
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each other. Paradoxically this could lead to a larger flex- 
ibility for a poly (A) loop than for a poly(T). This points 
out the difficulty to get reliable values of the persistence 
length from experiments that do not investigate the hair- 
pins themselves. However the values of e that we have se- 
lected are in the expected range for single-stranded DNA, 
and we assume that the larger value of e corresponds to 
poly(A). Figure [7] shows that, for a given loop length, 
Tjn decreases when the rigidity of the loop increases, in 
agreement with experiments [22]. Moreover, as observed 
experimentally, the melting temperature of the hairpins 
decreases with increasing loop length. For the model we 
obtain cx L'" with i/ w 0.12. 



= 0.0016 eV/A^ 



100 



80 



60 



40 



20 



240 260 280 300 320 340 360 380 400 
T(K) 

£ = 0.0012 eV/A^ 



100 
80 
60 
40 
20 



1 



240 260 280 300 320 340 360 380 400 
T{K) 

Fig. 8. Variation versus temperature of the percentage of open 
hairpins for two values of e and different loop lengths: L = 10 
(squares) L = 14 (circles), L — 24 (triangles) and L = 32 
(diamonds) . 



There are however two aspects on which the model 
quantitatively disagrees with experiments. First it gives 
a width of the melting transition which is significantly 
larger than in experiments. The model finds that the tem- 
perature range over which the percentage of open hair- 
pins varies from 20% to 80% extends above approximately 
50 K (depending on L) while experiments measure a range 
of about 15 K for poly(T) loops and about 30 K for 



poly (A). Second, as shown in Fig. [51 the model gives a 
variation of versus L which is approximately the same 
for poly(A) (e = 0.0016 eV/A^) and for poly(T) (e = 
0.0012 eV/A^), while experiments indicate that the ef- 
fect of the loop length L should be significantly larger for 
poly(A) than for poly(T). 



0.0016 eV/A^ 



lO"" 



10* . 



10' 



10^ 



,3 I ♦ 



10' 



2.8 2.9 3 3.1 3.2 3.3 3.4 3.5 3.6 3.7 
1000/T(K"^) 
e = 0.0012 eV/A^ 
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10^ 



10-^ 
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2.8 2.9 



3.1 3.2 3.3 3.4 
1000/T(K"') 



3.5 3.6 3.7 



Fig. 9. Variation versus temperature of the reaction rates for 
opening kf (open symbols) and for closing kr (closed sym- 
bols) for two values of e and different loop lengths: L = 10 
(squares) L = 14 (circles), L = 24 (triangles) and L = 32 
(diamonds). The reaction rates are plotted in logarithmic 
scale, versus 1000/T. The calculations have been made with 
Do = 3.0 10^ cmVs. 



Figure [9l shows the variation versus T of the reac- 
tion rates for opening kf and closing kr for different loop 
lengths, for two values of e describing poly(A) and poly(T) 
loops. As noted in subsection 13.31 the order of magnitude 
of the values that we obtain for the reaction rates are in 
agreement with the experimental results [6]. Another im- 
portant point is that kf \s nearly independent of the loop 
length (Fig. [9]) or loop sequence (Fig. [TOj), as observed 
experimentally. The variation oi kf versus T is well de- 
scribed by an Arrhenius law with an activation energy 
Eo ~ 0.74 eV (or 17 kcal/mol, while experiments report a 
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2.8 2.9 3 3.1 3.2 3.3 3.4 3.5 3.6 3.7 
1000/T(K"^) 



Fig. 10. Comparison of the temperature variation of the re- 
action rates for opening kf (open symbols) and for closing k,. 
(closed symbols) for two values of e: squares e = 0.0016 eV/A^ 
(poly(A)), circles e = 0.0012 eV/A^ (poly(T)). 

0.0016 I ' ' ' ' 1 




Fig. 11. Closing times of the hairpin at 300 K, Tr = k^ , ver- 
sus L for two values of e: squares e = 0.0016 eV/A^ (poly(A)), 
circles e = 0.0012 eV/A^ (poly(T)). The points are the numer- 
ical values given by the model and the lines are fits according 
to the formula indicated in the graph. 

higher value of 34 kcal/mol). Conversely the closing rate 
depends on the loop length or sequence. Lower rates are 
obtained for longer, or more rigid, loops, as one would 
expect qualitatively by considering that closing is mainly 
determined by the random diffusion of the two sides of 
the loops that bind when they find each other in space. It 
is interesting to examine the variation of the closing time 
Tr = versus the size of the loop L, shown in Fig. [TlJ 
It can be approximated by the scaling law 

Tr - To (42) 

with an exponent (j) — 2.09 for the poly(T) case and 
(f) = 2.22 for the more rigid poly(A) case. These values 
should be compared with the values (j)o = 2 for a Gaus- 
sian chain or = 1.8 obtained for a flexible polymer with 
excluded volume effects [26; . Our results that give a lower 
exponent (j) when e is reduced are consistent with this the- 
oretical predictions. Some experimental results report an 
exponent of 2 ± 0.2 ^275, but the scaling was measured 



on very small loop (4 < L < 12). While the closing rate 
depends strongly on the loop sequence, in the temper- 
ature range that we investigated it is well described by 
an Arrhenius law with an activation energy that depends 
weakly on the sequence. For L = 24, we get Ec = 0.148 eV 
(3.4 kcal/mol) for e = 0.0016 eV/A^ (poly(A)) and = 
0.128 eV (2.96 kcal/mol) for e = 0.0012 eV/A^ (poly(T)). 

The results presented up to now have been obtained 
by describing the DNA strands with a KP polymer model. 
This model is interesting because it allows us to describe 
the energetic effects associated to the bending of the strand. 
However, as we have seen that the results exhibit some 
limitations of the hairpin model, it is interesting to ex- 
amine the infiuence of the model chosen to describe the 
properties of the loop. FiguresfOltofTTlshow the results ob- 
tained if we consider the strand as a Freely Rotating Chain 
(FRC) ^28], i.e. a polymer made of segments of length £, 
such that two consecutive segments make a fixed angle 9 
but can rotate freely around each other (Fig. [T^ . The en- 
ergy of a FRC chain is a constant and the contribution of 
the polymer is only cntropic. 




Fig. 12. The Freely Rotating Chain polymer model. The angle 
between consecutive segments is fixed, and each segment can 
rotate freely around the axis defined by the previous one. 



The probability distribution function P^^^{p) of the 
FRC cannot be expressed analytically but it is easy to 
obtain it by a Monte Carlo simulation. This numerical 
expression can be introduced in the calculation of the 
constrained partition function according to Eq. (fTT|) . but, 
as for the KP chain, we need an analytical expression of 
S{p'\p) to carry out the calculations. Figure fT3l shows that 
it can again be provided by an effective Gaussian approxi- 
mation determined by choosing the parameter r according 
to Lr2 = {p^)/6. For the FRC, one has {p^} = L £^{1 + 
cos0)/(l — COS 9) so that the value of x to be entered in 
the expressions (l2Tt and (fT8)) is x = (l + cos0)/(l — cos0). 

In order to compare the two polymer models, we have 
selected for the FRC case values of 9 which give a persis- 
tence length comparable to the cases that we investigated 
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for the Kratky Porod model. The matching cannot be per- 
fect because, as the FRC model has a constant energy, its 
persistence length does not depend on temperature, con- 
trary to the KP case. We have selected the values of 9 so 
that the persistence length of the two models match at 
T = 300 K. For the FRC model, the persistence length is 



A' = -- 



ln(cos6l) ■ ^"^^^ 

The values = 54.945 ° and 6* = 61.667 ° give the same 
persistence lengths as the KP model at 300 K for e = 
0.0016 eV/A2 (poly A) and e = 0.0012 eV/A^ (poly T). 
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Fig. 13. Comparison between the FRC distribution function 
P£^'~^'{p) (full line) and the Gaussian approximation P^ip) 
(dashed line) for two values of L. The crosses show the dis- 
tribution function Pl{p) obtained by starting from the FRC 
distribution P[3f (p) and computing the probability distribu- 
tion of the end-to-end distance of a polymer extended by two 
units, using the conditional probability S{p'\p) according to 
Eq. H The parameters for the FRC model are 6 = 54.945 °, 
^ = 6 A, giving a persistence length A — 10.8 A. 



Fig. 14. Variation of the opening temperatures of DNA hair- 
pins deduced from the model where the loop is described by 
a FRC model for 6 = 54.945 ° (circles) and 6 = 61.667 ° 
(squares). The curves show fits with the function T = TqL" . 
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Fig. 15. Variation versus temperature of the percentage of 
open hairpins with a FRC model of the loop for two values of 
6 and difi'erent loop lengths: L = 10 (squares) L — 14 (circles), 
L — 24 (triangles) and L — 32 (diamonds). 



The comparison of figures [13 to [T71 for the FRC model 
with the corresponding figures with the KP model shows 
that most of the results are qualitatively similar for both 



models. The melting curves of Fig. [T5] for the FRC model 
exhibit a narrower temperature range for melting than the 
corresponding curves of Fig. [8] for the KP model, which 
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Fig. 16. Variation versus temperature of the reaction rates for 
opening kf (open symbols) and for closing fer (closed symbols) 
with a FRC model of the loop for two values of 9 and different 
loop lengths: L = 10 (squares) L = 14 (circles), L = 24 (trian- 
gles) and L — 32 (diamonds). The reaction rates are plotted in 
logarithmic scale, versus lOOO/T. The calculations have been 
made with Do = 3.0 10® cm^/s. 



0.0018 
0.0016 
0.0014 
0.0012 
0.001 
0.0008 
0.0006 
0.0004 
0.0002 



10 



15 



20 



I 2.09 



25 



30 



35 



Fig. 17. Closing times of the hairpin at 300 K, t — k,.^ , versus 
L for a FRC model of the loop for two values of 8: squares 
e = 54.945 ° (poly(A)), circles 6 = 61.667 ° (poly(T)). The 
points are the numerical values given by the model and the 
lines are fits according to the formula indicated in the graph. 



The effect of the size of the loop on the closing times 
of the hairpins at 300 K is very similar for the FRC and 
the KP models (Figs. [T71 and [TT|) because the parameters 
of the two models have been selected to give the same 
persistence lengths at this temperature. 

In summary, the comparison between alternative de- 
scriptions of the polymer properties of DNA single strands 
shows that different models can bring some quantitative 
differences but that the qualitative results are not changed; 
the main discrepancy between theory and experiments 
concerning the variation of T„i versus L for different loop 
lengths, which is greater for more rigid loops in experi- 
ments while the theory gives the opposite, does not seem 
to be resolved simply by using another polymer model. 



6 Conclusion 



would be closer to experimental observations. But both 
models show a larger variation of Tm when the persis- 
tence length of the loop is larger, which disagrees with 
the observations. 

Figures [16] for the FRC model and [9] in the KP case 
show the same general behavior that the opening rate kf 
is almost independent of the length of the loop, whereas 
the closing rate kr varies by more than one order of magni- 
tude when L changes from 10 to 32. But there is a qualita- 
tive difference between the FRC and Kratky Porod model 
(which is partly hidden by the logarithmic scales of the fig- 
ures) concerning the activation energy for closing. While 
it was of the order of 0.13 eV (3 kcal/mol) for the Kratky 
Porod model, in agreement with experiments, it is 5 times 
smaller for the FRC model (« 0.57 kcal/mol). This is 
consistent with the absence of any energy contribution in 
the FRC, whose rigidity is described in purely geometri- 
cal terms, and in fact points out the model's deficiency in 
describing the properties of DNA strands. 



In this study we presented a theoretical model of the 
physics of DNA hairpin formation and melting which tries 
to capture the essential phenomena within a highly simpli- 
fied picture. Basically it combines a model for the double 
helix assembly with standard polymer concepts. This ap- 
proach exhibits successes and weaknesses which are them- 
selves instructive for understanding the properties of DNA 
and RNA strands. 

Our mesoscopic approach provides acceptable system- 
atics for thermodynamic and kinetic properties of hair- 
pins with a poly(T) loop. Using realistic parameters for 
the binding energies, persistence length of the loop and 
diffusion coefficient of the polymer, the KP variant of the 
model describes the variation of versus L and the or- 
der of magnitude of opening and closing times. It shows 
that the kinetics of the opening is almost unaffected by 
the length of the loop, in agreement with experiments. 
Closing times increase very significantly for longer loops 
while the corresponding activation energy is almost inde- 
pendent of L, as observed experimentally. The enthalpy 
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of closing is quantitatively described while the enthalpy 
of opening given by the model is only half of the observed 
value. This aspect is related to the temperature range over 
which the melting transition is found in the model, which 
is significantly broader than in the experiments. Although 
the model is only semi-quantitative in some respects, it 
is nevertheless able to describe a whole set of equilibrium 
and non-equilibrium data with a small set of realistic pa- 
rameters. It should be stressed that studying thermody- 
namics and kinetics in the same framework is a rather 
demanding test. 



Weaknesses appear when one tries to apply the the- 
ory to poly(A) loops. The model correctly detects that 
Tm is lowered but it finds that the variation of Tm with 
L is smaller for poly(A) than for poly(T). Intuitively this 
makes sense because one can understand the decrease of 
Tfn with L as an entropic effect due to the fluctuations 
of the loop. As poly (A), with its larger bases is consid- 
ered to be more rigid than poly(T) [22J, which is reflected 
in the higher value of e that we introduce in the KP de- 
scription of poly (A), one can expect that this extra rigid- 
ity reduces fluctuations thereby decreasing the entropy 
gain due to loop extension. Thus basic physics leads to 
the conclusion that the effect of the loop length should 
be smaller for poly(A) than for poly(T) but experiments 
show exactly the contrary. Another discrepancy between 
our model and experimental data appears when one ex- 
amines the enthalpy for closing given by the kinetic stud- 
ies. Experiments find that AHc is approximately 5 times 
larger for poly (A) than for poly(T), while we only get a 
small increase when e is changed from 0.0012 eV/A^ to 
0.0016 eV/A^. Varying parameters one can increase AHc 
for poly(A) in the model, but the disagreement with exper- 
iments is transfered elsewhere, in particular on Tm- These 
discrepancies between theory and experiments for poly(A) 
are frustrating but probably also very instructive. They 
suggest that "rigidity" is not the only feature that dis- 
tinguishes poly (A) from poly(T), otherwise the KP model 
would be able to describe it. It appears that the effect 
of having large bases which can stack on each other is 
deeper and might not be captured by a simple polymer 
chain model. 



In conclusion, attempting to put the thermodynamic 
and kinetic properties of DNA hairpins in the same model 
framework remains a challenge. Our results show that the 
role of the loop is decisive and, for poly (A), extends be- 
yond a simple rigidity effect. This indicates that experi- 
ments on hairpins are very sensitive probes of the prop- 
erties of single-stranded DNA on a scale of a few tens of 
base pairs. In other words, beacons tell us not only about 
themselves but also about mesoscopic properties of single- 
stranded DNA and RNA, which have a high biological 
relevance. 



A Calculation of the conditional probability 

S{p'\p) for a Gaussian chain. 

Let us consider a Gaussian chain made of orientationally 
uncorrelated links such that the probability for any seg- 
ment to lie along a vector A is proportional to 
exp(— |/lp/4T^). The probability that the end-to-end dis- 
tance of a chain of L monomers is at distance p is then 



(44) 



with cr^ = Lt^ , where the p^ prefactor comes from the 
integration over all orientations of the end-to-end vector. 
It is such that (p^) = L = 6a^. Consider such a Gaus- 
sian chain with an end-to-end vector p, and assume that 
we add to each end segments Ai and A2. Its end-to-end 
vector becomes p' = p + Ai — Ai and the conditional 
probability that the end-to-end distance of the extended 
chain is p', given p, is 



S{p'\p) = A^p' J dAi dA2 



e"^^"+^"-^/^^"5{p' -p-A^ + A2) (45) 

where A is a normalization constant to be determined at 
the end of the calculation, and where the first integral over 
the orientations of p' is introduced because we are only 
interested in the end-to-end distance of the chain. The 
integration over A2 is immediate. Let us define u — p' —p. 
Up to a normalization factor we get 

S{p'\p)^Ap'^ j dQp, j dA^e-^"/^^" 



+1 



dp e 



where the integral over 



P- 



u ■ Ai 

u Ai 



(46) 



(47) 



comes is the integration over the azimuthal angle of Ai. 
This leads to 



S{p'\p)^Ap'^ I dQp,-e--''^^' 
J u 

/ dAi zlie^-^i/^^' sinh 
Jo 



uAi 



(48) 



up to normalization factors. Using the definite integral 

b pK 



J{a,b) 



dx X e sinh hx 



4a V a 



(49) 



we can perform the integration over Ai. Reintroducing 
u = p' p, and defining 



P'P 

V = — — 
P P 



(50) 
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we get 



S{p'\p) - Ap'^ / d77 e-(p'^+P^-2p'p.)/8r^ (5^) 



Performing the final integration, and determining the nor- 
malization constant from 



r dp's{p'\p)=i vp 

Jo 



(52) 



we obtain 



.(p'|p) = ,/J^^e-(P'^-Wsinh(0) . (53) 



B Calculation of the first passage time in a 
diffusion controlled process. 



We consider the Smoluchowski equation (j33p for the prob- 
ability distribution V{p,t), with a function J-'{p) which 
has the shape of a double well with a local maximum at 
p = p* . Initially the system is assumed to be in the well 
Po < p < p* and we assume a reflecting condition at the 
boundary p — p^, which implies j{po,t) — Vi. To de- 
termine the first passage time above the maximum at p* 
an absorbing boundary condition is assumed for p — p*. 
It can be expressed as j{p*,t) = KV{p*,t) and taking the 
limit K ^ oo. 

The probability that the system is still in the original 
well at time t is /^^ V{p,t)dp, so that the first passage 
time above the barrier is 



dt / dpV{p,t) 

•I Po 



(54) 



and has been calculated in References [12] and [2U] . For the 
sake of completeness we give an outline of the derivation 
[20] in the context of the present study. 

Integrating Eq. ^62\ with respect to p we get an expres- 
sion of j(p, t), which can be used to express the boundary 
condition at p* as 



j{p\t)^KV{p\t) 



and write Eq. ([33]) as 



Po 



dt 



■ dV{p,t) 
dp — — = 



Po 



dt 



dp dp 
d 



(56) 



Integrating over r in the range {R, p*) we get 

eP^^p'^V{p\t)-eP^^''^V{R,t) 
dV{p,t) 



dr 

Do e-P^(-) 



dp- 



dt 



(57) 



Using the boundary condition ([55)1 we obtain V{R,t) as 
V{R,t) 



I ^-I3:F(R) rp* Q-p 



Po 



dr 



In Doe-Pn,,^^ 
Let us define po (^) by 



dt 



J" dpe-P^ip) 



(58) 



(59) 



which is the probability that the system is at position r 
in the first well, weighted in this well so that it verifies 

//o Po{''')dr = 1. It leads to 



Kpo{p* 

Po{R) 



(60) 



■'po 

dr 



R Do Pair) 



dp 



dr{p,t) 

dt 



Using this expression to calculate r according to Eq. ([51)1 
gives 



1 



r''* f dr r 

/ dRpo{R) / dpV{p,t = Q) 

Jpo Jr Dopo[r) Jp„ 



dpV{p,t^ 0) + 
dr 



where we used limt^^V{p,t) = Vp. Now since the 
system is assumed to be at equilibrium in the well po < 
p < p* at t = 0, it follows from the definition of po{p) that 
'P{p,t^O)= po{p). Therefore 

1 dr r 

T= dRpo{R) — dppo{p), 

KPoip*) Jp„ Jr DoPo{r) Jp^ 

(61) 



Let us define H{r) by 



DoPoir) Jp^ 



(62) 



(55) We have 



1 



npoip*) 
1 

npo{p*) 
1 

KPoip*) 



rP* fP* 

/ dRpo{R) I drH(r)0(r-R) 

J Po J Po 

r dr H{r) r dRpo{R)0{r-R) 

J Po J Po 

P* r'r 

dr H{r) / dRpo{R) (63) 

n Po 



where 0{x) is the Heaviside step function. If we replace 
H{r) by its expression l|62|) . we obtain 



Kpo(p*) 



dr- 



Do Po{r) 



dR po{R) 



(64) 
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Taking the limit n ^ oo corresponding to the absorbing 
boundary condition when the system escapes above the 
barrier, we finally obtain 

r = /' dr—\-I^ir) , (65) 

with 

/(r) = / dRpo{R) , (66) 

J Po 

which is the resuh of Eqs. and (l35|l . 
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